36 PART 1 Getting Started with Biostatistics
statisticians can describe quantitatively how these random fluctuations behave
using mathematical equations called probability distribution functions. Probability
distribution functions describe how likely it is that random fluctuations will
exceed any given magnitude. A probability distribution can be represented in sev-
eral ways:»
» As a mathematical equation that calculates the chance that a fluctuation
will be of a certain magnitude. Using calculus, this function can be integrated,
which means turned into another related function that calculates the proba-
bility that a fluctuation will be at least as large as a certain magnitude.»
» As a graph of the distribution, which looks and works much like a
histogram.»
» As a table of values indicating how likely it is that random fluctuations will
exceed a certain magnitude.
In the following sections, we break down two types of distributions: those that
describe fluctuations in your data, and those that you encounter when performing
statistical tests.
Distributions that describe your data
Here are some common distributions that describe the random fluctuations found
in data analyzed by biostatisticians:»
» Normal: The familiar, bell-shaped, normal distribution is probably the most
common distribution you will encounter. As an example, systolic blood
pressure (SBP) is found to follow a normal distribution in human populations.»
» Log-normal: The log-normal distribution is also called a skewed distribution.
This distribution describes many laboratory results, such as enzymes and
antibody titers, where most of the population tests on the low end of the
scale. It is also the distribution seen for lengths of hospital stays, where most
stays are 0 or 1 days, and the rest are longer.»
» Binomial: The binomial distribution describes proportions, and represents the
likelihood that a value will take one of two independent values (as whether an
event occurs or does not occur). As an example, in a class held regularly
where students can only pass or fail, the proportion who fail will follow a
binomial distribution.»
» Poisson: The Poisson distribution describes the number of occurrences of
sporadic random events (rather than the binomial distribution, which is for
more common events). Examples of where the Poisson distribution is used in
biostatistics is where the events are not as common, such as deaths from
specific cancers each year.